ARCHIVAS 001 



PATENT 



IN THE SPECIFICATION 

Pursuant to 37 CFR § 1.121(b)(l)(i)-(ii), please delete the paragraph beginning on 
page 4, line 16 and continuing through line 29, and replace it with the following 
paragraph, which includes markings to show all the changes relative to the previous 
version of the paragraph: 

"According to the present invention, an N+K coding technique is described for 
use to protect data that is being distributed in a redundant array of independent nodes 
(RAIN). The data itself may be of any type, and it may also include system metadata. 
According to the invention, the data to be distributed is encoded by a dispersal operation 
that uses a group of permutation ring operators. In a preferred embodiment, the dispersal 
operation is effected using a matrix of the form [In_C] where In is an n x n identity sub- 
matrix and C is a k X n sub-matrix of code blocks. The identity sub-matrix is used to 
preserve the original data. The sub-matrix C preferably comprises a set of permutation 
ring operators that are used to generate the code blocks. The operators are preferably 
"polynomials" that are selected from a group ring of a permutation group with base ring 
Z2 , e.g., a set of permutations whose action on the data is taken to be the XOR of the 
actions of the individual permutations The i^^ code block is computed as: €4-f(gil( A4. 
)r^^^-^gi«(Aft)) Ci = f (gn ( Aj ^ . . . gin(Anl) . where f() is preferably addition mod 2 (i.e., 
XOR), and g( ) is a permutation operator as described above. Each code block is 
preferably stored on a separate node." 

Pursuant to 37 CFR § 1.121(b)(l)(i)-(ii), please delete the paragraph beginning on 
page 4, line 30 and continuing through page 5, line 10, and replace it with the following 
paragraph, which includes markings to show all the changes relative to the previous 
version of the paragraph: 

"In a more specific embodiment, an N+K (4,2) coding scheme is implemented. 
In this case, a file to be archived comprises four (4) data blocks (Al, A2, A3, A4). A 
dispersal matrix comprises six (6) code blocks (CO, CI, C2, C3, C4, C5). Because the 
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identity sub-matrix is used, however, the first four code blocks (CO, CI, C2, C3) are 
just copies of the first four data blocks, and these data blocks Safe are then stored in 
four distinct nodes of the array. The sub-matrix C is then generated as follows. 
Assume that g is a permutation operator that comprises a polynomial of cyclic 
permutations, such as: bo * c^ + bi * c^ + bkC^+. . . b (m-i) * c ^'^'^\ where bk is a bit (0 
or 1), c^ is the identity ("do nothing to the data"), and c^ is a cycle operation c repeated 
k times, e.g., the operation: "cycle the data k words." The i^^ code block is then 
computed as: € |-f(fel(A^), ...gin(A» ^ Q = f (g n( A j), ...gin(An)) . The C4 code 
block is then stored in the 5^*^ node, and the C5 code block is stored in the 6^*" node to 
complete the encoding process." 

Pursuant to 37 CFR § 1.121(b)(l)(i)-(ii), please delete the paragraph beginning on 
page 9, line 22 and continuing through page 10, line 6, and replace it with the following 
paragraph, which includes markings to show all the changes relative to the previous 
version of the paragraph: 

"Any scheme that reduces r from (c-l)(n-l) of the "copying" scheme involves 
using code blocks that somehow mix the data. In general, any such mixing can be 
thought of in terms of a matrix product such as illustrated in Figure 2. In this example, 
G is a r by « matrix, and ^ is an « column vector (the data blocks). This matrix 
product produces t (= n+k+r) code blocks. The i^^ code block is computed as: 
Q - f (&1( A^ -^h-^rrSiMA^^ Ci = f (gii( Ai . . .g^An )). Here,theg'scanbeany 
functions acting on the code blocks, and/a function that acts on these intermediate 
results to mix them, producing the Ci elements. If this were a normal matrix 
computation over integers, the action of gij is a multiplication and the action of/is an 
addition. There is no reason, however, that these operations must be normal 
multiplications and additions. If one considers that the individual blocks are usually 
themselves long strings of information that may be mixed by the operators gij, each of 
the gij can itself be considered as a matrix, operating at a finer resolution. Similariy, 
one may move from any (non-block) erasure code to a block erasure code just by 
grouping the fine scale operations into blocks. Of course, designing the code to take 
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advantage of the block nature of the problem may produce computational savings. 
Decoding proceeds by inversion in the usual manner." 

Pursuant to 37 CFR § L121(b)(l)(i)-(ii), please delete the paragraph beginning on 
page 10, line 4 and continuing through line 17, and replace it with the following 
paragraph, which includes markings to show all the changes relative to the previous 
version of the paragraph: 

"Thus, according to the present invention, an NK coder is described for use to 
protect data that is being distributed in an RAIN archive. The data itself may be of 
any type, and it may also include the archive metadata. According to the invention, 
the data to be distributed is encoded by a matrix operation that uses an identity sub- 
matrix to preserve the data words, and that uses permutation ring operators to generate 
the code words. The operators are preferably polynomials that are selected from a 
group ring of a permutation group with base ring Z2. The i^^ code block is computed 
as: G i'f (&1( A4 ^r^TTfeft(A»)) Ci = f(g n( Ai ), ...g aAJV where f( ) is preferably 
addition mod 2 (i.e., XOR), and g( ) is a permutation operator, such as a polynomial of 
cyclic permutations. Illustrative operators include, for example, 1 = s^ ("do nothing"), 
s" ("shift right n words"), 1+s" (XOR, unshifted image with shifted n), and so forth. 
With these operators, (1+s) (aia2a3) = (ai+a3)(a2+ai)(a3+a2). The invention is desirable 
as most operators are very fast. Where matrices are not invertible, the de-convolve 
operation can be used, i.e., given a first word al, decode (l+s)(A) = 
(((ai4-a3)(a2+ai)(a3+a2). A de-convolution example is shown in Figure 3." 
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